A Rule Based Syllabification Algorithm for Sinhala

نویسندگان

  • Ruvan Weerasinghe
  • Asanka Wasala
  • Kumudu Gamage
چکیده

This paper presents a study of Sinhala syllable structure and an algorithm for identifying syllables in Sinhala words. After a thorough study of the Syllable structure and linguistic rules for syllabification of Sinhala words and a survey of the relevant literature, a set of rules was identified and implemented as a simple, easy-to-implement algorithm. The algorithm was tested using 30,000 distinct words obtained from a corpus and compared with the same words manually syllabified. The algorithm performs with 99.95 % accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Report on Phonetics and Phonology of Sinhala

This report examines the major characteristics of Sinhala language related to Phonetics and Phonology. The main topics under study are Segmental and Supra-segmental sounds in Spoken Sinhala. The first part presents Sinhala Phonemic Inventory, which describes phonemes with their associated features and phonotactics of Sinhala. Supra-segmental features like Syllabification, Stress, Pitch and Into...

متن کامل

Festival-si: A Sinhala Text-to-Speech System

This paper brings together the development of the first Text-to-Speech (TTS) system for Sinhala using the Festival framework and practical applications of it. Construction of a diphone database and implementation of the natural language processing modules are described. The paper also presents the development methodology of direct Sinhala Unicode text input by rewriting Letter-to-Sound rules in...

متن کامل

Automatic Segmentation of Separately Pronounced Sinhala Words into Syllables

Aligned corpora are widely used in various speech applications like automatic speech recognition, speech synthesis, as well as prosodic and phonetic research. The segmentation into syllables can be done manually or automatically. But it consumes significantly more time for a fully manual phonetic segmentation and practically it is a complicated task because in many cases it requires a large ali...

متن کامل

Are rule-based syllabification methods adequate for languages with low syllabic complexity? the case of Italian

Syllabification information is a valuable component in speech synthesis systems. Linguistic rule-based methods have been assumed to be the best technique for determining the syllabification of unknown words. This has recently been shown to be incorrect for the English language where data-driven algorithms have been shown to outperform rule-based methods. It may be possible, however, that data-d...

متن کامل

A Rule Based Algorithm for Automatic Syllabification of a Word of Bodo Language ISSN 2319 - 2720

The process of syllabification performs the task of Identifying syllables in a word. The correct Syllabification rules and algorithms are mainly used in text-to-speech system to improve naturalness of the synthesized speech. This paper presents a study of Bodo syllable structure and linguistic rules for syllabification as well. An algorithm has been developed for automatic syllabification of Bo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005